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ABSTRACT 

A fully analytic statistical formalism does not yet exist to describe radio-wavelength 
measurements of linearly polarized intensity that are produced using rotation measure 
synthesis. In this work we extend the analytic formalism for standard linear polariza- 
tion, namely that describing measurements of the quadrature sum of Stokes Q and U 
intensities, to the rotation measure synthesis environment. We derive the probability 
density function and expectation value for Faraday-space polarization measurements 
for both the case where true underlying polarized emission is present within unresolved 
Faraday components, and for the limiting case where no such emission is present. We 
then derive relationships to quantify the statistical significance of linear polariza- 
tion measurements in terms of standard Gaussian statistics. The formalism developed 
in this work will be useful for setting signal-to-noise ratio detection thresholds for 
measurements of linear polarization, for the analysis of polarized sources potentially 
exhibiting multiple Faraday components, and for the development of polarization de- 
biasing schemes. 
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1 INTRODUCTION 

Radio-wavelength observations of linearly polarized syn- 
chrotron emission enable studies of ionized gas and magnetic 
fields in, and along the lines-of-sight to, energetic astrophys- 
ical environments. Faraday rotation measur e (RM) synthe- 
sis (|Burn| [l966; Bre ntiens fc de Bruvn| [2005l is a technique 
for Fourier transforming observational polarimetric data to 
produce a complex Faraday dispersion spectrum. The mag- 
nitude of this spectrum, which we denote by \ J-(a, 8, (j>)\, en- 
capsulates the intensity of linearly polarized emission exhib- 
ited at different Faraday depth^j, <j), along a single physical 
line of sight with sky coordinate (a, 8). 

Statistics describing measurements of linearly polar- 
ized intensity derive d from | J- (a, 8, (j>)\ h ave been investi- 
gated empirically by iGeorge et all |201lf ) and analytically 
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1 Faraday depth is not a physical depth, but rather the depth 
of Faraday rotating magnetised plasma between a source of 
polarized emission and the telescope; see equation (3) from 
iBrentiens fc de Bruvnl j2005l") . 



bv lMacquart et all l |2012f ). However, a fully analytic descrip- 
tion is yet to be presented. Such statistics are required to 
enable detailed quantitative analysis of polarimetric data 
from existent radio facilities such as th e Australia Te l escop e 
Compact Array Broadband Backend ( Wilson et al.l 1201 if ) , 
the Westerbork Synthesis Radio Telescope, the Giant Me- 
trew ave Radio Telescop e, and the Expanded Very Large Ar- 
ray (IPerley et al.ll201lf). a nd from future surveys such as 



POS SUM jGaensler et al 
tory ( Johnston et al .1200; 



2010fl with the AS KAP observa 



Deboer et all2009f ). GALFACTS 



ijTavlor fc Saltej 20101 ') with the Arecibo observatory, and 
the Magnetism Key Science Projec10 with the Low Fre- 
quency Array. 

The statistics exhibited by Faraday-space measure- 
ments of linear polarization are qualitatively similar, yet in 
general quantitatively different, to those of standard linear 
polarization. Intensity measurements of the latter, denotecjfl 



2 http://www.mpifr-bonn.mpg.de/staff/rbeck/MKSP/mksp.html 

3 The term L is commonly used to differentiate linear po- 
larization from the more general elliptical polarization P = 
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by L, are obtained for a given line of sight by taking the 
quadrature sum of measured Stokes Q and U intensities, 
namely 



L(a,5) = vlQMF+FMf. 



(1) 



The statistics of L and discussion of detection thresholds 



are well documented (Rice 1945: Sim mons fc Stewart 1985: 
iLeahv fc Ferninilll989l : IVailrancourtJl2006h . In contrast, in- 



tensity measurements of Faraday-space linear polarization 
must be obtained by first devising a method to extract some, 
or all, of the polarized emission that may be present over 
a range of Faraday depths (0) within the Faraday disper- 
sion spectrum (J r ) for a given line of sight. The specifics of 
this extraction process will dictate the resulting polariza- 
tion measurement statistics. In this work we focus on mea- 
surements produced by extra cting the pealfl intensity from 
a cleaned (He ald et al l 120091 ) Faraday dispersion spectrum, 
denoted by T cln (note that this is not the clean component 
spectrum, but rather the cleaned spectrum which contains 
convolved clean components plus residuals), namely 



r (a, S) = max \ \J rcln {a, S, <f>)\) 



(2) 



where we use the term L RM to differentiate these measure- 
ments from those of standard L. Measurements of L RM , 
as defined in equation are suitable for the analysis of 
data consisting of unresolvecQ polarized emission in Fara- 
day space; such conditions are often encountered observa- 
tionally due to the limited bandwidth capab ilities of many 
present-day telescopes (e.g. iHeald et al.ll2009l ) or the under- 
lying physics of target sources (e.g. pulsars). Like L, L RM 
is positive semi-definite (^ 0) and exhibits non-Gaussian 
statistics. 

In this paper we seek to relate the statistical signifi- 
cance of measurements of L RM with those of L and of stan- 
dard Gaussian statistics for general observational setups, in 
order to facilitate detailed quantitative analysis. To meet 
this aim we analytically, rather than empirically, derive the 
probability density function (PDF) and expectation value 
for measurements of L RM for the general case where true 
underlying polarized emission is present, and for the limit- 
ing case whe re no such em i ssion is present. For comparison, 
we note that lGeorge et all |201l[ ) have presented an empir- 
ical investigation of detection thresholds and the PDF for 
L RM for a specific observational setup; we seek to formally 



y'Q 2 + U 2 + V 2 , which includes Stokes V. While we do not use 
P in this work, we follow the L notation to ensure consistency 
with future deep surveys in which many sources exhibiting both 
L and P emission are likely to be detected. 

4 In practice, the peak should be fitted to minimise pixel discreti- 
sa tion errors; for exam ple, see discussion of 3-point parabolic fits 
bv lHales etldl J2012I) . 

5 Just as the peak surface brightness of an unresolved source 
in a two-dimensional (2D) image (measured in Jy beam - 1 ) is 
equal in magnitude to its integrated surface brightness (or flux 
density; measured in Jy), so too is the peak polarized inten- 
sity of an unresolved component in Faraday space (measured 
in Jy beam -1 RMSF - 1 , where RMSF is the rotation measure 
spread function, or the un it of resolution in Faraday space; 
iBrentiens fc de Bruvr]|2005h equal in magnitude to its Faraday- 
integrated polarized intensity (measured in Jy beam -1 ), modulo 
any statistical or measurement-induced biases. 



generalise these results he re. Additionally, t hroug h compar- 
ison with simulated data, IMacauart et al.l (|2012T ) identified 
a missing correction factor in their analytic PDF that we 
derive here. 

We begin in § [2] by reviewing the existing analytic sta- 
tistical description of L, and by deriving a relationship that 
equates the significance of detections in L with those of 
Gaussian statistics. In § [3] we extend these analytic results 
to L RM , noting two key experiment-specific parameters that 
dictate its observed statistical properties. In §[3] we demon- 
strate use of our derived significance relationships for L and 
L R m through worked examples, and discuss how individual 
lines of sight exhibiting multiple unresolved Faraday-space 
components may be treated. In § [5] we use our analytic re- 
sults to illustrate how cross-sectional profiles for astronomi- 
cal sources with 2D elliptical Gaussian morphologies are af- 
fected when observed in images exhibiting polarization mea- 
surement statistics, namely, where each pixel in an image 
of L or L RM is formed using equation fT} or ©, respec- 
tively. Using these profiles we briefly outline challenges for 
robust source extraction in polariz ation images. In §[6] w e ad- 
dress the point recently raised bv lGeorge et alj (|201lr ) that 
non-Gaussianities in images of Stokes Q and U will com- 
plicate the calculation of robust significance relationships. 
We present our conclusions and comment on future work 
regarding polarization bias in § [7] 

For notational convenience throughout this work we will 
drop the explicit (a, S) notation (cf. equations above), but 
note that the statistics we discuss refer to the distribution 
of intensities that an individual line of sight (or pixel) may 
exhibit. As pointed out in § [21 this is not necessarily the 
same as discussing the statistics of a sample of measurements 
from different lines of sight, or pixel intensities within some 
spatial region of an image. 



2 STANDARD LINEAR POLARIZATION 

The magnitude of observed standard linear polarization, L, 
is given by equation Q. For true underlying Stokes inten- 
sities Q and U in the presence of Gaussian measurement 
errors g q and a v , respectively, the observed Stokes intensi- 
ties Q and U as used in equation ([TJ are given by 



Q = Qo ± ctq , 

U = U ±<Tu. 



(3) 
(4) 



The true underlying and unbiased linearly polarized signal 
is given by 



Lo = VQl + us 



(•») 



Measurements of Stokes Q and U may be obtained at 
radio wavelengths using either individual spectral channel 
observations, resulting in individual measurements of L for 
each observed channel, or band-averaged (e.g. using multi- 
frequency synthesis) observations, resulting in a single mea- 
surement of L for the entire band. Analytically, there is no 
need to differentiate between these two approaches; the an- 
alytic statistical descriptions of L for the individual channel 
and band-averaged approaches are identical; both can be 
described by equations ([3]) and Q for a sample of mea- 
surements with given L a . For completeness, we note that 
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the technique of RM-synthesis is preferable to the band- 
averaged L approach because the latter is more prone to 
bandwidth depolarization, in which rotations of spectral Q 
and U measurements through the complex plane can cause 
their band-averaged values to become diminished. Sepa- 
rately, we note that discussion regarding the statistics of 
stacked measurements of L is beyond the scope of this work; 
for example, as a result of summing multiple L measure- 
ments from individual spectral channels over which Faraday 

rotation may be occurring. 

The PDF for L is given bv lRicd ljl945T l as 



f{L\L ) 



■ exp 



L 2 + Lt 
2a% „ 



LL 



(6) 



where o- QU is a noise term explained below, L 0, Ik{x) 
is a modified Bessel function of the first kind with order 
k and argument x, and it is assumed that the true po- 
larized intensity L 0) is known. In equation ((6]) and 
in future use, we simplify notation by assuming that mea- 
surement error is implicitly specified in all priors; for ex- 
ample, we imply f(L\L ) = f(L\Lo, ctq,u)- Equation ((6} is 
known as the Ricean distribution. Formally, it is only valid 
for (Jq.u — oq = ujj. We assume this to be the case here, 
and discuss issues regarding <tq 7^ a n in Appendix |A"1 The 
Ricean distribution is displayed for several values of the ratio 
Lo/ctq,u in the top panel of Fig. [T] 

The cumulative distribution function (CDF) for L is 
obtained by integrating equation ©, giving 



F(L\L ) = 1 - Qi 



L L 



(7) 



^CTQ,U 0~q,u , 

where Qi(a, /3) is the Marcum Q-function (|Marcumlll948T l. 
Qi(a, 0) is defined by 



Qi(«,/3) = / a; exp 
Jb 



x 1 + a' 



I (a.x) dx , 



(8) 



and may be effic i ently calculated using the algorithm pre- 
sent ed bylsimonl (ll998T l. 

iRice! j 19451 ) gives both the expectation value (E) and 
variance (Var) of equation © as 



E(L|L ) 



1F1 



- 5 ;i; 



Var(L|L ) = L' + 2a 2 QiU - [E(L|L )] 2 , 



, and (9) 
(10) 



where x Fi is a confluent hypergeometric function. In the 
absence of in put signa l (i.e. L — 0) , the Ricean distribution 
limits to the lRavleigbl (|l880h distribution given by 



f(L\L o =0) = 



■ exp 



2a'j 



(11) 



Q,U \ ^ U Q,U. 

where L ^ 0. The expectation value and variance of the 
Rayleigh distribution is given by 



' Q,U 



E(L|L = 0) = 



Var(L|L = 0) = (4 - tt) 



and 



>Q,V 



(12) 
(13) 



The CDF for the Rayleigh distribution is obtained by inte- 
grating equation (|11[) , giving 



F(L\L =0) = 1-exp - 



L 2 



2cr, 



Q,U 



(14) 



Unlike a Gaussian distribution, the Ricean distribution 
is signal (i.e. L ) dependent; the Ricean PDF changes shape 
depending on the magnitude of the underlying input signal 
(in comparison, the shape of a Gaussian distribution is not 
influenced by terms in its PDF that relate to its true or ob- 
served mean). The Ricean distribution is positively skewed 
(right-skewed) and leptokurtic (positive excess kurtosis) for 
weak input signal, while for stronger input signal the dis- 
tribution becomes Gaussian about mean L with standard 
deviation oq, u . It is this signal dependence that prevents 
one from assuming a uniform variance (i.e. from assuming 
that Var(L|L ) from equation (|10p is uniform) for different 
lines of sight that have equal oq.u (for example, a sample 
of spatial pixels with equal oq,u in an image of L, where 
the intensity for each pixel is calculated using equation ([TJ)). 
The signal dependence also co mplicates estimation of L 
given a measurement of L (e.g. Simmons fc Steward 1 19851 ; 
iLeahv fc Ferninill 19891 ; IVaillancourtll2006l ). 

We note that while the PDF for L is non-Gaussian, 
the noise in a Ricean distribution (represented by (Jq.u) is 
Gaussian in character, reflecting the nature of measurement 
uncertainty in Stokes Q and U (such that ctq.lt characterises 
the manner in which random errors are propagated into 
measurements of L). The term Ricean noise therefore has 
the potential to be misleading, as it may incorrectly suggest 
that the Ricean distribution exhibits those properties usu- 
ally associated with regular Gaussian noise, such as signal- 
independence. If the term Ricean noise is used, then the pre- 
fix Ricean should be interpreted in the same way that, for 
example, shot noise, which is governed by Poissonian statis- 
tics and is thus signal-dependent, differentiates itself from 
standard Gaussian noise. 



2.1 Detection Significance 

To quantify the significance of a measurement of L in terms 
of a well- recognised statistic, we relate its probability for 
Type I (false positive) error to that of an equivalent mea- 
surement of intensity in Gaussian noise. We use the term 
"equivalent" to indicate that the same noise term (Tq,u from 
L is used as the standard deviation for the Gaussian dis- 
tribution. We define the signal-to-noise ratio (SNR) of a 
measurement of L as L/ctq,u. This definition makes use of 
the observable quantities L and o~Q tU \ we do not relate L 
to equation JTUJ), which includes the unobservable and In- 
dependent term E(L|L )- We generically denote a measure- 
ment of intensity in Gaussian noise by G (e.g. a measurement 
of Stokes Q intensity), and define the SNR of our equivalent 
Gaussian measurement as G/o QtU . By equating the CDF for 
a Rayleigh distribution [equation (|14fl ] with the standard 
confidence interval for a Gaussian (i.e. erf[|G|/(-\/2<7Q,t/)], 
not its CDF), and by selecting the magnitude of G as the 
appropriate equivalent measure to compare with L, we quan- 
tify the Gaussian equivalent significance, denoted by G ES , 
for a measurement of L as 



\G ES \/a QiU = v^erf-Nl -exp 



1 ( L 



2 \o-q,, 



(15) 



or, conversely, the linear polarization equivalent significance, 
denoted by L ES , for a measurement of G as 
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T ES I 

Li /<Tq,U = 



-2 In 



1 - erf 



\G\ 



(± 

V\/2 Oq.u 



(16) 



where erf and erf -1 are the error function and its inverse, 
respectively. 

The Gaussian equivalent significance relationships 
above may be used to set SNR cutoffs for polarization sur- 
veys, designed to meet the same statistical criteria as stan- 
dard G/a SNR cutoffs in surveys with Gaussian noise. Ex- 
amples illustrating use of these equations are presented in 
§ [4] Equation (|16[1 , and thus implicitly equation (|15|) , is dis- 
played in Fig. [2] 



3 FARADAY-SPACE LINEAR POLARIZATION 

In this section we derive customised statistics to describe 
measurements of L RM obtained using equation ([2j|. We be- 
gin by discussing two experiment-specific parameters that 
will be needed for this derivation: M, which characterises 
an effective sample size in Faraday space, and a RM , which 
characterises the noise in Lj R m- 

RM- synthesis can be thought of as a technique to eval- 
uate J-(<j>) over a range of trial Faraday depths spanning 
i^.,, which may be set b y [equations (35) and (63) from 
iBrentiens fc de Bruvrj[2005| 



V3 



(17) 



where <5(A 2 ) are spectral channel widths in wavelength- 
squared (A 2 ) space for each i'th observed channel; the mini- 
mum <5(A 2 ) characterises the maximum Faraday depth ^ m „ 
at which polarized emission can be detected. The effec- 
tive resolution in Faraday space is [equation (61) from 
IBrentiens fe de Bruvn|[2005l ] 



2%/3 
A(A 2 ) 



(18) 



which is set by the o bserved wavelength-sq u ared range 
A(A 2 ) = \ 2 max ~ X 2 min . IBrentiens fe de Bruynl (|2005r ) note 
that equations (|17[l and (|18[1 assume a top hat weight func- 
tion that is unity between A mi „ and A max and zero else- 
where. In general this will not be the case [cf. equations (|20[) 
and (|2ip presented shortly], requiring both (f) max and ip to 
be determined empirically. For example, ip may be fit with 
a Gaussian; this is analogous to fitting a Gaussian to an 
experiment-specific point spread function in aperture syn- 
thesis imaging. Combining equations (|17[) and (|18[) . we find 
that |.F cta (0)| is effectively comprised of 



M = 



2<p„ 



independent s ample s, 
iGeorge et all (|201ll ) 



(19) 

as was recognise d by both 
and iMacauart et all |2012l ). In 
other words, no more than M statistically independent 
measurements of linearly polarized intensity may be ex- 
tracted from \T cln {(j>)\, assuming T is sampled with at 
least one trial <p per resolution element ip. However, this 
description of M independent samples is only formally 
correct for ideally deconvolved signals. It is not appropriate 
for describing noise, which will consist of M independent 
samples that have been permanently correlated with one 



another, due to the filtering nature of the discrete Fourier 
transform underlying the RM-synthesis technique. The 
presence of such correlations must be addressed to ensure 
a complete statistical description of L RM - Further below, 
we decribe how the noise term a RM may be defined so as 
to account for such correlations, enabling the notion of M 
independent samples to be effectively maintained. We note 
that the statistics describing measurements of \J rcln ((pt)\ 
for some fixed trial Faraday depth <pt (using the subscript t 
momentarily for clarity), are given by those of L from § [2] 
This is because for each trial (j> t , RM-synthesis essentially 
unwraps the observed spectral Q and U data in the complex 
plane so that their band-averaged values may be used to 
compute T{4>t)- Denoting the number of observed spectral 
channels by T, we note that the form of equation (|17|) 
ensures that M > T for T > 1; only for the trivial case 
T = 1 does M = T. 

In this work we assume that T consists of unresolved 
(Faraday-thin) components. We al so assume that T has 
been cleaned (e.g. with RM-CLEAN; iHeald et all 12009) in an 
idealised manner (which may not be met in practise for 
RM spread functions exhibiting strong sidelobes) to pre- 
vent components in the spectrum from being contaminated 
by sidelobes from other components. Therefore, L RM as de- 
fined in equation © can be characterised as the maximum of 
M independent samples within a cleaned Faraday dispersion 
spectrum, each of which exhibits the statistics of L discussed 
in § [21 and each of which may or may not contain any true 
underlying signal L n . For completeness, we note that the 
sequential processing techniques of RM-synthesis and de- 
convolution require signal sparsity in <^>-space (cf. aperture 
synthesis imagin g and the image sparsity requirem ent of the 
CLEAN technique JCornwell, Braun. fe Briggslll999l '). For this 
work, we therefore require that the majority of M indepen- 
dent samples in T are signal-free (i.e. with L = 0). We do 
not consider the analysis of non-sparse Faraday dispersion 
spectra. 

We denote the noise term for L RM by a RM - We define 
this t erm as [note equation (38) from IBrentiens fc de Bruynl 
120051 ] 



0~RM — 



1 T 2 2 

v (EL,-.) 2 



(20) 



where <jQ tUii is the noise in the i'th channel, rj is a correction 
factor described shortly, and Wi are weighting factors for the 
observational data in each i'th channel. For example, the 
channel weights may be chosen using least squares, 



1 



' Q.U.i 



(21) 



noting as in §[2] that the analysis in this section is only for- 
mally valid for o Q ^ Uti = a Q-i — a Vii in each i'th channel; 
see Appendix lAl for discussion regarding a Qti 7^ a Vii . The 
factor 77 in equation ([20} is required to account for correla- 
tions between samples of l^'"^)! at different depths (f>. For 
clarity, we note that if L RM were defined by \J- cln {<f>t)\ for 
fixed 4>t, then rj = 1 would be appropriate because issues re- 
garding selection of the maximum of > 1 correlated samples 
would be inapplicable. Moving on, we assume an experimen- 
tal setup where the total number of trial <f> samples across 
T is given by 
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2</f>„ 



+ 1, 



(22) 



where each sample is spaced apart by S((j>); i.e. a sampling 
rate of k/M per ip. The autocorrelation function is given 
by the magnitude of the RMSF for positive Faraday depths, 
which we denote by \Rh\ = \R[h5((j>) 0]| with integer index 
h — 0, 1, assuming an RMSF with span ±20 max . 
Given positive correlation between samples, as will always 
be the case given \Rh\, estimates of a RM obtained using 
equation (|20p with r) = 1 will always under estimate the true 
value. A correction for this bias is given by I Anderson! |l97ll ; 
see equation (51) in chapter 8, adjusted to represent sample 



variance as 



— Efi-- 

K — 1 7— 1 \ K 



\R h 



(23) 



Using the details above, we now deri ve the PDF for L RM 
in th e context of order statistics (e.g. iDavid fc Nagaraial 
2003), first assuming L = in all M samples, then extend- 
ing to the scenario where 1 of M samples in the Faraday 
dispersion spectrum contains an underlying signal L > 
(i.e. a single Faraday-thin component). We will not ex- 
tend this derivation to the more general scenario in which 
each independent sample in the Faraday dispersion spec- 
trum may have its own independent value of L ^ (i.e. 
multiple Faraday-thin components), though in principle the 
relevant PDF for this situation could be derived using ele- 
ments from the derivations below. However, we do discuss 
detection thresholds for this scenario in § 14.31 Additionally, 
we will not attempt to derive PDFs for fully general scenar- 
ios in which resolved polarized emission in Faraday space 
(i.e. Faraday-thick components) may be presenl(f|. 

For a sample of iV independent and identically- 
distributed variates X 1 ,X 2 ,...,X N ordered such that 
A(i) < X(2) < . . . < A ( jv) (using notation X {j) for ordered 
variates and Xj for unordered variates), then X {h) is known 

(iv) = max(Xj). If X has 



as the fc'th order statistic and X, 



PDF f(X) and CDF F{X), then lDavid fc Nagaraial (|2003l ) 
give the PDF for A (fc) as 



/(*(*)) 



AH 



{F[X (k) ]} k 



(k-l)l(N-k)\{l-F[X m }} 



7=wf l X 



(24) 



Assuming absence of an underlying input signal (L = 
0), the PDF for L RM is derived by substituting equa- 
tions mi) and (Q3) into equation (glj) with N = M and 
k — M, giving 



f(L RM \M, L = 0) = 



„ , Lrm 

M — — exp 



1 — exp 



T 2 



2(7 



RM , 
-\M-1 



2ol 



(25) 



where L RM 1? 0. Equation (|25l) is displayed for several val- 
ues of M in the middle panel of Fig. U as M increases, 
for example as a result of increasing the spectral resolution 
in an experiment, so do the resulting measured intensities. 



e A systematic positive bias in Faraday sp ace, similar t o tha t 
referred to in the image plane as peak bias bv lHales et al. h2012f) , 
will need to be accounted for when measuring the peak polarized 
intensity for resolved (Faraday-thick) sources. 



The expectation valued for equation (|25[) is obtained using 
integration by parts, a Taylor expansion, and term-wise in- 
tegration, giving 



E(L RM \M,L =0) 



0) dL RM 

E [s-i hlf- 1 x 

(26) 



2 

Ml 



(M-sy.si 

Equation (|26[1 limits to equation (|12[) when M = 1. Equa- 
tion (|26|) represents the mean value of L RM that will be 
observed for a line of sight containing no polarized emission 

L = 0). Asymptotically , this mean value grows as ~ Vm M 

David fc Nagaraial I2003T ). 

We now extend our derivation to the scenario where 
M — 1 independent signal- free samples are drawn from equa- 
tion pip and 1 sample with arbitrary L > is drawn from 
equation ©, such that L RM represents the observed maxi- 
mum of these M samples. The distribution for the maximum 
intensity value of M — 1 signal- free samples, which we denote 
L M -i, is derived in the same m anner as equation l| 2 5|), bu t 
with N = k = M - 1. Following IDavid fc Nagaraial l|2003T) . 
the CDF for L RM = max(L M -i, £) is then given by 

F{L RM \M,L )=F(L M _ 1 \M,L o = 0)F(L\L ) 

The PDF for L RM is therefore 

-F(L RM \M,L ) 



(27) 



f(L RM \M, Lo) 



dLjM 

f{L M -x\M, Lo = 0)F(L\L o ) + 
F(L M _ 1 |M, L o = 0)f(L\L o ) 



■ exp 



exp 



r 2 

J-i r, 



2a% 

T 2 

^ R H 

2ai 



(M-l) 



exp 



Qi 



Lq L/RIS 



O RM 0~RM 



2ai 



+ 



exp 



2al 



Io 



LrmLq 



(28) 



where L RM ^ 0. Equation (|28l) is displayed for several val- 
ues of L a /u R m for an M = 30 observing setup in the bottom 
panel of Fig.[T] M — 30 has been selected for illustrative sim- 
plicity, as suitable for a 1.4 GHz observation with 200 MHz 
bandwidth split into 24 spectral channels. The expectation 
value for equation Q28II. 



F>(L RM \L ) 



L R m /{L/ R m\Lo) (1L R m , 



(29) 



7 We note that a derivation o f the expect a tion value for the 
signal-free case is attempted by iHeald et al. I <2009l) , where they 
equate the (M — 1)/M quantile of the CDF with the expected 
value of the largest order statistic. A better approximation i s 
M/(M + 1); see equation (4.5.1) from lDavid fc Nagaraial <|2003h . 
We present the exact solution in equation 126t . 
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Figure 1. Top: The Ricean distribution, equation 10, dis- 
played for several values of true polarization SNR L /a-Q yU . When 
L = the Ricean distribution limits to the Rayleigh distri- 
bution, equation lilt . Middle: The distribution for L RM with 
L = 0, equation H25H , displayed for several values of M, the ef- 
fective number of independent samples in |.F c ' n (0)| as defined by 
equation (1 1 9 D . This panel illustrates how different values of M 
affect the mean value of L RM for lines of sight free from polar- 
ized emission (Lo = 0). When M = 1, equation (1 25 It limits to 
the Rayleigh distribution, equation Hilt . Bottom: The distribu- 
tion for L RM , equation 128P . displayed for the M = 30 case (as 
suitable for a 1.4 GHz observation with 200 MHz bandwidth split 
into 24 spectral channels) for several SNRs L /cr RM . As L /a RM 
increases, equation 1288 limits to the Ricean distribution (dotted 
curves, replicated from top panel). However, unlike a Ricean dis- 
tribution, as L /cr R M — > 0, equation H28H limits to the signal-free 
distribution from equation 1 1251) (dashed curve, replicated from 
middle panel). 



cal characteristics of L RM to be quantified for a range of 
experimental setups. 

Furthermore, our results provide an explanation for the 
discrepancy between the simulated and t heoretical PDFs 
presen ted in the lower panel of Fig. 6 from iMacquart et al.l 
|2012l ). in which the effects of correlation were not consid- 
ered. To demonstrate, we evaluated equation (|23[) for an 
RMSF repre senting an expe r iment al setup similar to that 
described bv lMacquart et al.1 (|2012T ). with 24 x 8 MHz chan- 
nels between 1296 and 1480 MHz, and Faraday space sam- 
pling given by #(</>) = 5 rad m -2 wit h ^ = 4000 rad m~ 2 . 
The result was <frj = 0.935. Thus IMacquart et all (|2012l ) 
overestimated their SNRs by ~ 7%, consistent with their 
observed discrepancjjf]. (Note that the diminished peak den- 
sity in their simulated PDF is accounted for by the Jacobian; 
their green curve is not normalised.) 



3.1 Detection Significance 



Following § 12.11 we quantify the Gaussian equivalent sig- 
nificance, denoted by G R m, for a measurement of L RM by 
equating the CDF for L RM (i.e. F(L RM \M, L = 0), ob- 
tained by integrating equation (|25[) ) with the standard con- 
fidence interval for a Gaussian, giving 



\r< BS 



\/a RM = ^2 err 1 \ 1 



exp 



1 / L RM 

2 V a RM 



.(30) 



Alternatively, equation (|30[) may be rearranged to quantify 
the linear polarization equivalent significance, denoted by 



for a measurement of G, giving 



T ES I 

j - , rm/"RM = 



\ 



-2 In < 1 - 



erf — 



y/2 



(31) 



Examples illustrating use of these equations are presented 
in § SI Equation (|31|) . and thus implicitly equation (|30p . is 
displayed for several values of M in Fig. [2] 



4 EXAMPLES 



In § 14.11 and § 14.21 we construct examples to demonstrate 
use of the signal-free expectation value equations and sig- 
nificance relationships derived for L in § 12.11 and 1j R m in 
13.11 respectively. In § 14.31 we describe how the significance 
relationships for L RM may be used to assess Faraday disper- 
sion spectra comprising multiple unresolved Faraday compo- 
nents. 



does not appear to have an analytic solution; it may be 
evaluated numerically. As with the Ricean distribution, the 
distribution for L RM is signal-dependent, positively skewed, 
and leptokurtic. When the magnitude of L is comparable 
to the noise a RM , the distribution for L RM will approach 
the signal- free case from equation Q25p . For larger L /a R M, 
the distribution will approach the Ricean distribution from 
equation ([6|. 

The results presented in this section provide a theoret- 
ic al explanation for th e empirical curves presented in Fig. 4 
of iGeorge et al.l (|201ll ) for their specific experimental setup. 
Parameterised by M, the equations above enable statisti- 



4.1 Standard Linear Polarization 

The expectation value for a measurement of L for an 
emission-free (L — 0) line of sight is given by equation (|12p ; 
cquivalently, equation (|12[) returns the average observed in- 
tensity for a spatial pixel situated away from real sources in 

8 Separately, we note that Macquart et al. denned noise 

per channel (i.e. <Tq,[7,») as the quadrature sum of o-q^ and <7u,i, 
so that o-Q t/ i = v2aQj for the case cfq^ = a v i . As mentioned 
above and in §[2] a more appropriate definition is uq^ua = <?Q,i = 
cru.i- 
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Figure 2. Detection thresholds for L RM that exhibit equivalent 
Type I (false positive) error rates to those of standard Gaussian 
detections, G. The curves trace equation (13 1 P for several values 
of M, limiting to equation l |16|l for M = 1. 



an image of standard linear polarization, namely ~ 1.25o"Q l( y 
(e.g. see behaviour of dashed curves in Fig. [3]). 

Using equation (|15[) . we find that the detection of a lin- 
early polarized source with SNR L Q U /a QiU = 4.0 is equiv- 
alent in significance to the detection of a ±3.6<tq. u source 
under Gaussian statistics. 

Using equation (|16p or the M = 1 curve from Fig. [21 
we find that a detection threshold of L/a QiU = 5.4 must be 
imposed in order to ensure that polarization detections have 
an equivalent Gaussian significance in excess of ±5.0<jq :U 
(i.e. greater than 99.99994% confidence). 



4.2 Faraday-Space Linear Polarization 

The expectation value for a measurement of L RM for an 
emission- free line of sight is given by equation ()26|) ; its value 
depends on M. Equivalently, equation (|26[) returns the av- 
erage observed intensity for a spatial pixel situated away 
from real sources in an image of peak Faraday-space linear 
polarization, which for an example image with M = 30 is 
found to be ~ 2.78a RM (e.g. see behaviour of solid curves in 
Fig.©. 

We now demonstrate the statistical significance rela- 
tionships for L aM with M = 30 using the same examples 
from g 14.11 

Using equation (|30|) . we find the detection of a source 
with L RM I<Jrm = 4.0 to be equivalent in significance to the 
detection of a ±2.6<tq,u source under Gaussian statistics. 

Using equation (|31[) . we find that a detection threshold 
of L RM I<Jrm = 6.0 is required to ensure equivalent Gaussian 
significance in excess of ±5.0a RM - 



4.3 Multiple Unresolved Faraday Components 

In § [3] we derived the PDF for L RM by assuming that 
\T cln {4>)\ contains no more than a single unresolved Faraday 
component. While derivations of PDFs for polarized inten- 
sity measurements drawn from more complicated Faraday 
dispersion spectra remain beyond the scope of this work, 



we note that the single Faraday component assumption was 
not formally required to derive the significance relation- 
ships presented in § 13.11 Indeed, these relationships are suit- 
able for assessing the Gaussian equivalent significance for 
any number of the available M statistically independent 
measurements in \J rcln ((f>)\. This is because the relation- 
ships in effect benchmark the significance of any observed 
sample against the maximum theoretical noise sample ex- 
pected within \T cln (4>)\. Therefore, equations ([3U)> and (pTTj) . 
as demonstrated in § 14.21 may be used to evaluate the Gaus- 
sian equivalent significance for each candidate Faraday com- 



ponent in \T a ™(0)|; this practice will help to identify noise- 



induced components in co mplex Faraday d ispersion spectra 
(e.g. see data obtained bv lLaw et al.|[201ll v 



5 SOURCE PROFILES IN POLARIZATION 
IMAGES 

Two-dimensional images of linearly polarized intensity for 
Li or Li R m may be formed by calculating equation |T]) or 
([2]), respectively, for each independent spatial pixel. In this 
section we illustrate cross-sectional profiles for astronomi- 
cal sources as observed in images of linear polarization, to 
both demonstrate use of the equations derived earlier, and 
to briefly outline challenges that need to be met for robust 
image-plane source extraction. For demonstration we focus 
on the observation of sources with Gaussian morphologies; 
such sources are typically encountered in radio astronomy 
because of the well-approximated Gaussian nature of tele- 
scope point spread functions. 

In Fig. [3] we trace mean observed spatial profiles 
through Gaussian sources, each with full width at half- 
maximum (FWHM) standardised to unity, that have been 
embedded in images for which the intensities of individ- 
ual spatial pixels exhibit the statistics of either Gaussian 
noise, the distribution for L from equation ©, or the dis- 
tribution for L RM from equation (|28[) with M = 30; we 
assume infinitesimal pixel dimensions so as to ignore pixel 
discretisation effects. The curves displayed in Fig. 0were ob- 
tained analytically using the following approach. First, we 
constructed true underlying SNR profiles for our Gaussian 
sources as spatial functions of fractional FWHM, x, using 



SNR truc (x) = SNR£ u a e k exp 



-4 In (2) 



FWHM 



(32) 



This equation can be used to represent underlying cross- 
sectional profiles for Gaussian sources in linear polarization, 
i.e. L (x), noting that Gaussian profiles in images of Stokes 
Q and U remain Gaussian through equation ©. To obtain 
observed spatial profiles for the Gaussian noise (denoted by 
G), L, and L RM images, we then computed expectation val- 
ues as a function of x for the input signal defined by equa- 
tion (|32[) . For the Gaussian noise profiles, expectation values 
for observed SNRs equal their true underlying SNRs (i.e. ex- 
pectation values in Gaussian noise are signal-independent; 
i.e. (SNR obs (x)) = SNR truc (a;) , Vz). The expectation val- 
ues for L(x) and L RM (x) were computed using equations ((9]l 
and (|29[l . respectively, with L (x) given by equation (|32p . In 
signal-free regions (i.e. left and right of the sources), (L(x)) 
and (L RM {x)) limit to equations (|12|l and (|26[) , respectively. 
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Figure 3. Mean spatial profiles observed for Gaussian sources 
embedded within images exhibiting Gaussian (dotted curves), L 
(dashed curves), and L RM (for M = 30; solid curves) pixel inten- 
sity statistics, displayed for several input true peak SNRs. See §[5] 
for details. 

As SNRp 1 ^ increases, the curves for (L(x)) and (L RM (x)) 
limit to the Gaussian case, the latter more slowly (cf. Fig.[T]). 

Least squares 2D elliptical Gaussian fitti ng routines 
(e.g. the task IMFIT from the MIRIAD package; ISault et all 
1 19951 ) are typically used to extract sources from images ex- 
hibiting Gaussian noise. The polarization profiles in Fig. [3] 
suggest that Gaussian fitting routines may not be appro- 
priate for source extraction in images of linear polarization, 
unless low SNR wings are excluded from the fitting pro- 
cess (e.g. by imposing a SNR cut-off threshold for fitting). 
Additionally, the non-Gaussian distribution of pixel intensi- 
ties about the mean profiles illustrated in Fig. [3] will likely 
cause a systematic positive bias in extracted flux densities, 
particularly for low SNR sources. To address these chal- 
lenges, detailed inspections regarding the accuracy of source 
extraction methods in images of linear polarization are re- 
quired. Suc h analy sis is beyond the scope of this work; see 
lHales et al.l (|2012l ) for an analysis of source extraction in 
linear polarization. 



6 NON-GAUSSIAN NOISE IN Q/U1 

iGeorge et all l|201ll) recently suggested that aperture- 
synthesis imaging and calibration artefacts may introduce 
strong non-Gaussianities into the noise distribution for im- 
ages of Stokes Q and U, which will in turn affect the false 
detection rate of sources in linear polarization. To model 
these non-Gaussianities in Stokes Q and U, they suggested 
use of a compound distribution comprising a Gaussian distri- 
bution plus an exponential distribution; this is known from 
the psychological literature as the E x-Gaussian distribution 
jHohleil 19651 ; [Burbeck fc Lucelll982i ) . 

While it is likely that imaging artefacts will be present 
in images of Stokes Q and U, their influence should be 
largely accounted for in local estimates of rms noise (see e.g. 



lHales et al.ll2012T ). This process will ensure that the distribu- 
tion of pixel SNRs is well described by a Gaussian, in turn 
ensuring that local detection thresholds can be computed 
accurately using the equations presented in this paper. 

In an effort to explain the seemingly non-Gaussian 
distribution e xhibi ted in the lower panel of Fig. 7 from 
IGeorge et al] J201ll) in which Stokes Q data from the NVSS 
|Condon et al.ll 19981) were presented, we focus on two effects. 
We note that IGeorge et all (|201ll ) did not evaluate false de- 
tection rates using the NVSS data itself, but rather a simu- 
lated sky survey described as having characteristics similar 
to the NVSS. Here we examine whether the real NVSS data 
can be used to justify claims of strong non-Gaussian noise. 

The first consideration is the presence of real sources, 
which may be positive or negative in Stokes Q (or U). Such 
sources need to be masked prior t o investigation of the noise 
distribution. IGeorge et all (|201lh masked real total inten- 
sity sources out of the Stokes Q NVSS data investigated. 
We attempted to recover a non-Gaussian distribution using 
NVSS data by investigating a sample of 4° x 4° tiles selected 
to have central positions located along a line of constant 
declination with J2000 5 — +28°; for simplicity we did not 
analyse the full 2326 tiles comprising the NVSS. We selected 

75 of the 90 tiles in this declination range, avoiding 15 tiles 
containing missing pointings in Stokes Q. We supplemented 
this sample with an additional tile, C1232P12, chosen arbi- 
trarily to ensure that at least one tile containing pointings 
with significant amplitude calibration errors was included 
in our analysis. Thus our raw data sample consisted of 76 
tiles. We also investigated a subset of 73 of these tiles fol- 
lowing the removal of tile C1232P12, as well as two other 
tiles, C0432P28 and C0448P28, which were found to contain 
pointings with minor yet distinct calibration errors. The dec- 
lination range above was selected to be representative of the 
NVSS, comprising tiles positioned from the North Galactic 
Pole down to and below the Galactic plane. Tiles near the 
Galactic plane are likely contain large-scale emission unre- 
solved by the NVSS, which may in turn plausibly introduce 
non-Gaussianities into the data due to difficulties encoun- 
tered during deconvolution (e.g. ICornwell. Braun. fc Briggsl 
1999). To obtain Stokes Q images as free from true sources 
as possible, we conservatively masked all pixels that had cor- 
responding Stokes I intensities > mjy beam -1 . The rms 
noise in the NVSS is Oi ~ 0.45 mjy beam -1 in Stokes / and 
<tq w 0.29 mjy beam -1 in Stokes Q. Therefore, we note that 
if masking were only applied to pixels corresponding to cat- 
alogued NVSS sources, namely pixels with I > 4.5a"/, then 
real Stokes Q emission from sources with ~ 20% fractional 
polarization could remain unmasked with significance up to 
Q ~ 1.4o"q, biasing efforts to uncover the underlying noise 
distribution. We then compared histograms of Stokes Q pix- 
els intensities for the unmas ked and masked da ta, and for 
the distribution observed by IGeorge et al.l (120111 1. as shown 
in the upper panel of Fig. U We found that the unmasked 

76 tile data displa yed non-Gauss i an win gs above the NVSS 
levels reported by IGeorge et all l|201lf ). while the masked 
76 tile data displayed wings that fell off more rapidly with 
intensity than their levels. The 73 tile data was found to 
exhibit significantly attenuated wings c ompared with the 76 
tile data. The distribution described bv lGeorge et al.l (|201ll ) 
is inconsistent with the 73 tile data. The difference between 
the unmasked and masked 76 tile data in Fig. [4] is due to 
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Figure 4. Upper panel: Distribution of Stokes Q pixel intensi- 
ties using all pixels (gray points) or only those with correspond- 
ing Stokes / pixel intensities < mjy beam -1 (red points) for 
our full sample of 76 NVSS tiles (76T). The cyan points represent 
the masked distribution for our 73 tile sample (73T), following re- 
moval of 3 tiles containing pointings with easily discerned imaging 
errors. The solid curve is a fitted Gaussian to the 73T points, with 
o-fit = 0.293 mjy b eam -1 . The dashed curve is from the lower 
panel of Fig. 7 from lGeorge et al . ( 2011) for their observed NVSS 
data. Lower panel: Distribution of pixel SNRs corresponding to 
the points in the upper panel, obtained by assuming constant 
cr = Gf lt or using local rms noise estimates a = cr lac . The solid 
curve is a Gaussian with unit variance. The dot-dashed curve, 
obtained empirically, predicts the distribution of pixel SNRs that 
will be observed when rms noise values exhibit 10% error. The 
dashed curve represents the corresponding upper panel curve, 
normalised by am. Note that x-axes are not matched between 
upper and lower panels. 



real sources, while the difference between the masked 76 
and 73 tile data is due to the inclusion of tiles with sig- 
nificant imaging errors. Therefore, the discrepancy bet ween 
the NVSS distribution observed bv I George et al.l |201ll ) and 
the masked 73 tile NVSS distribution examined in this work 
is likely to be due to residual unmasked sources, the inclu- 
sion of corrupted tiles, a combination of both, or some other 
processing error. 

To explain why the masked 73 tile data exhibit small 
non-Gaussian wings, we now consider a second effect that 
may also lead to spurious claims of non-Gaussian noise. It is 
common for rms noise to vary spatially throughout an image 
due to imaging artefacts about strong sources, or intrinsic 
observational features such as primary beam sensitivity. If a 
histogram of pixel intensities is used as a proxy to examine 
the noise distribution within such an image, rather than a 



histogram of pixel SNRs (which require local rms noise es- 
timates), then the inferred noise distribution will appear to 
follow a Gaussian distribution with exponential wings. Sim- 
ilarly, if the distribution of pixel SNRs is examined for these 
images whilst assuming that rms noise is spatially uniform, 
then the inferred noise distribution will again appear to fol- 
low the Ex-Gaussian distribution. In both these examples, 
no intrinsic non-Gaussianities need exist. To demonstrate, 
consider the following illustrative scenario in which an im- 
age is arbitrarily divided into two spatial regions, each rep- 
resented by Gaussian statistics but with a different standard 
deviation. If 99.9% of the image has standard deviation 1 (in 
arbitrary units) and 0.1% has standard deviation 5 (this is a 
crude representation of the fraction of 76 Stokes Q tiles ex- 
hibiting calibration errors), then the observed distribution of 
all pixel intensities will follow an Ex-Gaussian distribution. 
Despite this suggested appearance of non-Gaussianity, the 
distribution of pixel SNRs, obtained using local rms noise es- 
timates, will exhibit purely Gaussian characteristics. Indeed, 
this simplified example further demonstrates that if an ob- 
served distribution of pixel SNRs is not Gaussian (following 
masking of real sources), then the implemented local rms 
noise estimation procedure may not be performing suitably. 

To construct distributions of pixel SNRs for the un- 
masked and masked NVSS data presented above, we 
used the rms estimatio n algorithm implemented within 
the SExtractor package l|Bertin fc Arnout3 :1996; Hol werdal 
2005) to generate a background rms noise map for each 
tile. We set the local mesh size to 24 x 24 pixel 2 , an area 
equivalent to N h = 50 independent resolution elements (see 
lHales et al.ll2012l ). Our estimates of local rms noise therefore 

have uncertainty {[l+0.75/(Af i) -l)] 2 [l-7V i 7 1 ]- 1 } 5 ~ 10% 
(using an approximation to the uncertainty of the stan- 

dard error estimator, suitable for N b > 10; see p. 63 of 
I Johnson fc Kot j ll970T ). or greater if many resolution ele- 
ments in a given mesh contain true sources. We then com- 
puted pixel SNRs using the local rms noise values. For com- 
parison, we also computed pixel SNRs by assuming a spa- 
tially uniform rms noise value for all tiles. This value was 
obtained from a fit to the masked 73 tile pixel intensity 
data, as indicated by the solid curve in the upper panel 
of Fig. [J] The resulting SNR histograms for the unmasked 
and masked NVSS data are presented in the lower panel of 
Fig- SI We found that the distributions constructed using the 
uniform noise level exhibited stronger non-Gaussian wings 
than those constructed using local noise estimates (the for- 
mer are equivalent to the pixel intensity distributions pre- 
sented in the upper panel of Fig. 2]), indicating the presence 
of spatial variations in image sensitivity. This is most clearly 
demonstrated by the masked 76 tile data; the failure of this 
data to exhibit a pure Gaussian distribution when using 
local noise estimates may be predominantly attributed to 
rapid changes in image sensitivity near corrupted pointings, 
where the accuracy of the rms noise estimation algorithm 
employed by SExtractor is diminished. The SNR distribu- 
tion for the masked 73 tile data with local rms noise esti- 
mates (blue dots) was found to closely follow a Gaussian 
distribution, modulo two apparently non-Gaussian features. 
First, neglecting bins with |SNR| > 6, the distribution was 
found to broaden with SNR against that of a true Gaussian. 
This broadening is predominantly due to variance in the rms 
noise estimates used to calculate SNRs, rather than any in- 
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trinsic features of the pixel intensity data. To demonstrate, 
we simulated a distribution of SNRs by drawing samples 
from a Gaussian with unit variance and dividing each sample 
by a noise term that was itself drawn from a Gaussian with 
unit mean and standard deviation 10%. The resulting dis- 
tribution is displayed in the lower panel of Fig. U providing 
a close fit to the observed data. Second, the distribution was 
found to exhibit 18 pixels with jSNR| > 7. We examined the 
NVSS image data to determine the origin of these discrepant 
pixels. We found that 5 of these pixels were situated one 
pixel beyond the masking boundary of a strong total inten- 
sity source, where the total intensity emission was observed 
to drop suddenly to become negative due an adjacent noise 
trough or possibly a cleaning artefact. These Stokes Q pixels 
were thus associated with unmasked real emission. The re- 
maining 13 pixels were situated within pointings exhibiting 
image striping, consistent with calibration errors; addition- 
ally, each of these pointings were situated close the Galactic 
plane, with Galactic longitude ~ 80° and latitude ~ —15°. 
Given these identifications, we conclude that there is no sig- 
nificant evidence for non-Gaussian noise in the Stokes Q 
NVSS data once pointings exhibiting easily discerned cali- 
bration errors have been removed. While we cannot rule out 
the presence of non-Gaussian noise in all surveys, the lack 
of evidence for such noise in the NVSS sample analysed here 
suggests that surveys with more sophisticated data reduc- 
tion are unlikely to be affecte d, and almost certain ly not at 
the strong levels suggested by [George et all (|201ll ). 



7 CONCLUSIONS AND FUTURE WORK 

We have derived customised statistics to describe Faraday- 
space measurements of linearly polarized intensity obtained 
using RM synthesis. The equations presented enable objec- 
tive determination of the significance of polarization detec- 
tions, which will be useful for upcoming surveys of radio 
polarization. 

We found that when the observation-specific parameter 
M was increased, larger detection thresholds were required 
for L aM to ensure that noise features were not mistaken for 
real polarized emission (e.g. see middle panel of Fig. [TJ . We 
found this effect to be exponential; M needed to be increased 
by at least an order of magnitude (e.g. from 10 to 100, or 
1,000 to 10,000) before the new detection threshold required 
to satisfy an original level of statistical significance needed 
to be significantly raised. Therefore, we conclude that it is of 
limited practical use to tailor observational setups to min- 
imise M, unless M can be reduced by at least an order 
of magnitude. A suitable strategy for large- M observations, 
such as those afforded by the M > 10 4 capabilities of fa- 
cilities such as the Australia Telescope Compact Array and 
Expanded Very Large Array, may be to first perform RM- 
synthesis using reduced spectral resolution (i.e. by averag- 
ing spectral channels to reduce T, and in turn M) in order 
to identify faint polarized emission over a reduced ±</> ma:E 
range. 

We also discussed source extraction in polarization im- 
ages and the importance of obtaining spatially-dependent 
rms noise estimates. 

We have not discussed the derivation of confidence in- 
tervals for polarization measurements, the setting of up- 



per limits, or polarization bias. While detailed inspection 
of these issues remains beyond the scope of this work, we 
close by briefly highlighting how our results may be used in 
the future to address each of them, as follows. 

Using the equations developed in this work, credible 
intervals (the Bayesian equivalent of frequentist confidence 
int ervals) may be cons tructed using the technique presented 
by I Vaillancourtl (|2006l ). Similarly, confidence bounds (which 
depend on observed intensities) and upper limits (which do 
not depend on observed intensities, but rather on the de- 
tection process and Type II error minimis ation) may be 
evalua ted using the techniques presented bv lKashvap et all 
(2010); a demonstration using observational data will be pre- 
sented by Hales et al. (in preparation). 

Finally, we note that the different statistics exhibited by 
L and L RM prevent the application of p olarization debiasing 
schemes designed for the fo rmer (e.g. ISimmons &: Steward 
1 19851 ; iLeahv fc Fernin] [l989) from being applied to the lat- 
ter. However, exceptions may be suitable in the limit to 
polarized sources strong enough to display a similar PDF 
in both L RM and L (e.g. compare curves in the lower and 
upper panels of Fig[T]). In general, a polarization bias cor- 
rection scheme designed for L RM would need to be parame- 
terised by a term such as M from equation (|19|) . so as to take 
into account the experiment-specific terms (f> maas and ij). We 
note that the fixed, un-p a ramet erise d debiasing schemes pre - 
sented by iGeorge et all (|201lh and iMacquart et all (|2012T ) 
are therefore limited in applicable experimental scope. Fu- 
ture investigations are clearly required to resolved these is- 
sues. Instead of attempting to correct observed flux densities 
for polarization bias alone, an altern ate approach may be to 
combine this correction with one for lEddingtonl (|l913h bias, 
which like polarization bias is always present. A demonstra- 
tion of this combined technique to remove both polarization 
and Eddington bias will be presented by Hales et al. (in 
preparation). 
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APPENDIX A: UNEQUAL NOISE IN Q/U 

Derivation of the PDF for L with gq 7^ a v requires 
marginalising the Euclidean norm of a bivariate normal dis- 
tribution over position angle, resulting in a complicated an- 
alytic expression that is difficult to utilise and is more easily 
obtained and analysed numerically. Given that g q 7^ a v may 
be encountered in observational data, as highlighted in the 



examples below, in this Appendix we investigate how g q ,u 
may be defined such that the analytic equations presented in 
this work for L and L RM may remain approximately valid. 
While it is not possible to fully model a bivariate normal 
distribution with a single noise term gq iV when g q 7^ g v , 
we note that the only region of PDF parameter space that 
needs to be accurately modelled is that of the noise outlier 
population (i.e. >5<r); in practice, detection thresholds may 
be suitably defined using this population, such that accurate 
modelling of the remaining parameter space is not required. 
We therefore focus here on obtaining a rudimentary defini- 
tion for oq,u that, under certain conditions, may facilitate 
use of the analytic relationships presented in this work. 

Two examples of data exhibiting g q 7^ o v are as fol- 
lows. First, consider images of Stokes Q and U in which a 
polarized source is present with signal Q 7^ U , as will be 
the case in general. Following deconvolution, it is possible for 
residual sidelobes and other artefacts about strong sources 
to affect one image more than the other, causing some lines- 
of-sight to exhibit g q 7^ a v . As a second example, spatial 
variations in root-mean-square (rms) noise may be present 
and independently-positioned throughout images of Stokes 
Q and U, where beam-sized noise elements are superposed 
on larger-scale undulations. For example, undulations in rms 
noise may be produced in aperture synthesis images by large 
scale emission that is unrecoverable by deconvolution algo- 
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rithms such as CLEAN IjCornwell. Braun. fe B riggs 1999J) , or 
in general radio imaging through insufficient flagging of data 
affected by radio frequency interference. Even surveys de- 
signed to exhibit spatially uniform rms noise, such as the 
NVSS (I Condon et al.llT998l ). exhibit undulations in rms noise 
due to a combination of the effects described above, thus en- 
abling some lines of sight to exhibit gq 7^ g v . Though the 
issues above may be mitigated by telescope design, observ- 
ing strategy, and data processing, the potential remains for 
lines-of-sight to exhibit gq 7^ g v . Additionally, and in a triv- 
ial sense, uncertainties in the estimator used to evaluate gq 
and g v (using hat notation here to indicate standard errors 
rather than true underlying standard deviations) will result 
in <jq 7^ ctq and g v 7^ g v , such that g q 7^ a v may result 
in situations where g q — a v . For example, if a mesh con- 
taining N b < 100 independent resolution elements is used 
to estimate the local standa rd error in an ima ge (e.g. as 
demonstrated in Stokes I by iHuvnh et al.l [20051) , then the 
uncertainty in this estimator will be > 7% (using the for- 
mula referenced in-text in § [6} . Given this and the examples 
above, how should oq,u be defined? Ideally, when a Q 7^ a v , 
the use of <Tq,u should be avoided altogether and all analysis 
should be conducted numerically to correctly utilise the true 
PDF. However, this may be cumbersome for typical situa- 
tions where &q and g v are within a factor of, say, ~ 10%. 

We begin by focusing on L and noting that in situations 
where g q = a v , a suitable definition for ctq.u may be given 
by 0.5(o"q + &u); this solution is more precise (i.e. exhibits 
less dispersion about the true standard deviation) than as- 
signing ctq_[/ = gq (or gq.u — g v ), because the probability 
of gq overestimating the true standard deviation is greater 
than the probability of both g q and g v overestimating it. 
Similar performance may be obtained b y defining gq ,u fol- 
lowing first-order error propagation (e.g. Clifford 1973) eval- 
uated about the point (Qo, U ), 
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We note that neglection of higher-order terms in the equa- 
tion above is formally incorrect (such terms are important 
when noise dominates signal); however, our rudimentary in- 
terest here regards assessment of the general first-order form 
of equation (|A1[) . rather than its detailed quantitative prop- 
erties. Equation (|A1|) indicates that a suitable first-order 
form for a QiU may be defined by 



(A2) 



with positive factors A Q and A v satisfying Aq + Au = 1, 
and where these factors may be assumed to be constants 
(i.e. independent of Q and U ) to ensure that a Q]U remains 
signal-independent (note comments at the end of § To 
investigate whether the use of equation (|A2|) . or the average 
standard error definition further above, could enable out- 
liers within empirically-obtained a Q 7^ a v distributions to 
be suitably characterised by the analytic distributions for L 
and L RM , we performed the following simulations. 

We populated discrete distributions of L/a QtU (dimen- 
sionless) for 1 ^ ctq/ctu ^ 1-5 and examined how closely 
outliers beyond ~ 5a were fit by equation (|11[) . Three def- 
initions for (Jq,u were tested: 0.5(a Q + a v ), equation (|A2|) 
with A Q = Au = 0.5, and a case where A Q and Au were 
varied in search of optimal values. We set up our simula- 
tions in two ways. In the first setup, we assumed that the 
variance in standard error estimates was zero, namely that 
&q = <tq and au — au- We found that selecting A Q = 1 or 
Au = 1 over- or underestimated the noise required to cor- 
rectly model true outlier populations, respectively. Selecting 
these values in turn under- or overestimated the true statis- 
tical significance of outliers, respectively. We therefore note 
that if we conservatively defined a Q , v = max (aQ, au), then 
the statistical significance of outliers, as well as that of true 
polarized sources, would never be overestimated (though 
they would certainly be underestimated). To limit the degree 
to which the significance of polarization detections (both 
signal and noise) could be underestimated, whilst still gen- 
erally preventing them from being overestimated, we found 
the following empirical values to be suitable for use in equa- 
tion (IA2I): 



Ac 



0.8 
0.2 



if a Q > a v 
if a Q < a v 



(A3) 



with Au = 1 — Aq. We found that use of these factors 
limited systematic underestimation of the statistical sig- 
nificance of G ES w 5a outlier samples to < 2% of their 
true, empirically determined values; this underestimation 
peaked at a Q /au ~ 1.2, diminishing elsewhere within the 
1 ^ a Q /au ^ 1.5 range tested. For comparison, we found 
that the conservative definition aQ tU = max (ctq, au) per- 
formed worse, typically underestimating true significance 
values systematically by ~ 6%. In the second simulation 
setup, we investigated the effects of uncertainties in <tq and 
a v by assuming what we envisaged to be ~worst-case 10% 
errors in each. The factors from equation (|A3|) were again 
found to be suitable in these simulations. 

The results of our L simulations are summarised in 
Fig. I All for the three a QiU definitions tested, and for three 
noise cases: a Q = au, a Q = l.lau, and a Q — 1.5au- We 
found that introducing variance into the standard error es- 



timates affected the empirical L distributions in a manner 
similar to the influence of introducing a Q 7^ a v , shifting 
outlier populations away from the analytic PDFs. We found 
that the 0.5(<t q + au) and Aq = A v = 0.5 definitions for 
ctq,u resulted in empirical distributions that were closely 
fit by equation pip at low-SNRs, but which resulted in in- 
creasingly poor fits for outlier populations as either variance 
was introduced to the standard error estimates or a Q 7^ au 
was introduced. The Aq = Au = 0.5 definition resulted in 
marginally improved fits for noise outliers compared with 
the average standard error definition. The third definition, 
using equation ()A2|) with factors from equation (|A3|) . re- 
sulted in empirical distributions most accurately fit by equa- 
tion (jlll) for outliers, though at the expense of poor fitting 
at low-SNR. We found that this definition of a Q ,u optimally 
accounted for unequal noise within the range tested, even 
when variance in standard error estimates was introduced 
(though less so towards the upper end of the range where 
a Q /a v = 1.5). 

We performed similar simulations for L RM to investi- 
gate suitability of the oq^u definitions considered above in 
this different statistical environment. We simplified the po- 
tential complexity of these simulations by assuming the fol- 
lowing illustrative ~worst-case setup based on the M — 30, 
24 spectral channel setup described in § [3] Individual sam- 
ples of L RM were obtained empirically by selecting the max- 
imum of M = 30 independent Rayleigh-distributed variates. 
Each of these variates was assumed to represent a uniformly- 
weighted stack of 24 spectral channels, namely with equa- 
tion (|21|l set to unity. We investigated unequal noise by 
systematically assuming a Qti = a v ,i, a Q:i = l.la U:i , or 
a Qii = 1.5<Tu,i for each i'th channel, and investigated the 
influence of uncertainties in estimates of standard error by 
introducing 10% error to a Qii and a v ,i for each channel (thus 
10/v24% in each of the 30 x 2 stacked Q and U values used 
to obtain each sample of L RM )- 

The results of our L RM simulations are displayed in 
Fig. IA2I We found similar behaviour of the different a Q U 
definitions to that exhibited in Fig. IA1I However, the effect 
of introducing unequal noise was found to be more promi- 
nent within outlier populations in Fig. lA2l due to their being 
selected from a maximum of M — 30 samples. This effect 
will be further pronounced for data with larger M, though 
because it is unlikely that all spectral channels will exhibit 
the systematic unequal noise assumed in our worst-case sim- 
ulations, the outlier populations in real data are unlikely to 
be as extreme as those presented here. Additionally, if chan- 
nel weighting is introduced according to equation (|21l) . the 
the impacts of bad channel data will be minimised and out- 
lier populations further reduced. Regarding the introduc- 
tion of variance in a Q ^ and a u ,i, we note that its impact 
on outlier populations will be minimised for data with a 
greater number of spectral channels, due to the 1/vT depen- 
dence in stacked channel data. As with the simulations for 
L, we found that by defining a QiU using equation (|A2|) with 
the factors from equation (|A3|) . outlier populations of the 
various empirical distributions investigated were aligned ro- 
bustly with the analytic distribution given by equation (1251) . 

We conclude that the analytic equations presented in 
the main body of this paper may be utilised for data exhibit- 
ing modest unequal noise of ~ 10%, without incurring sig- 
nificant biases, by defining a Q , v according to equation (| A2|l 
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Figure Al. Empirical PDFs for L obtained for three noise cases (coloured curves), in which each L sample was normalised by ctq.u 
in accordance with the three definitions indicated above the columns. For comparison, the analytic PDF for L given by equation Hilt is 
presented identically in each panel; it is not fit to the data. Data from panels in the first and third rows are displayed with logarithmic 
scaling in the second and fourth rows, respectively. The upper set of panels are for the simulations with known standard errors, namely 
for <7q = <tq and <j v = cry. The lower set of panels are for the simulations with 10% errors in ctq and atu- Small- number statistics begin 
to cause artificial broadening of the empirical distributions below densities of ~ 10 -5 . 



with factors from equation (|A3|) . While our simulations have 
indicated that this definition may be suitable for data ex- 
hibiting more extreme unequal noise, we caution that we 
have not considered the potential impacts of biases on po- 
larization position angles; future investigation regarding this 
issue is clearly required. 



This paper has been typeset from a TjnjX/ I^Tj^X file prepared 
by the author. 
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Figure A2. Empirical PDFs for L RM compared with the analytic PDF given by equation J25j; see text for details. The layout follows 
that described for Fig. IA1I 



